home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
Trading on the Edge
/
Trading On The Edge - CD-ROM Toolkit (Wayzata Technology)(2031)(1994).bin
/
pc
/
pc_files
/
venddemo
/
visanlst
/
t002
< prev
next >
Wrap
Text File
|
1993-01-06
|
9KB
|
217 lines
.LsnName AUTO Data & Control Files
.LsnDesc This set of screens shows the initial data,
the basic Information Harvesting control file, IH1CNTL,
and several of the initial parameter files.
.LsnScts Initial Cars Data, IH1CNTL, IH1LIM, IH2TRND, IH3NBIN, IH3CBIN
.LsnPrms Estimation data glosaadt
.Initial ~c~wThe flat file input data to the Information Harvesting modules ~
may be formatted with delimiters, such as commas or tabs, or may be in ~
fixed columns. ~
The CARS data is comma delimited and begins with one title line. ~
Following are the first rows of the CARS data.
~b
Mi/Gal,Cyl,Cu/In,Hpwr,Wt/Lbs,Acc/0-60,Year,Origin,Brand,Model
18.0,8,307,130,3504,12,1971,US,chevrolet,chevrolet_chevelle malibu
15.0,8,350,165,3693,12,1971,US,buick,buick_skylark 320
18.0,8,318,150,3436,11,1971,US,plymouth,plymouth_satellite
16.0,8,304,150,3433,12,1971,US,amc,amc_rebel sst
17.0,8,302,140,3449,11,1971,US,ford,ford_torino
15.0,8,429,198,4341,10,1971,US,ford,ford_galaxie 500
14.0,8,454,220,4354, 9,1971,US,chevrolet,chevrolet_impala
14.0,8,440,215,4312, 9,1971,US,plymouth,plymouth_fury iii
14.0,8,455,225,4425,10,1971,US,pontiac,pontiac_catalina
15.0,8,390,190,3850, 9,1971,US,amc,amc_ambassador dpl
15.0,8,383,170,3563,10,1971,US,dodge,dodge_challenger se
14.0,8,340,160,3609, 8,1971,US,plymouth,plymouth_'cuda 340
15.0,8,400,150,3761,10,1971,US,chevrolet,chevrolet_monte carlo
14.0,8,455,225,3086,10,1971,US,buick,buick_estate wagon (sw)
24.0,4,113, 95,2372,15,1971,Japan,toyota,toyota_corona mark ii
22.0,6,198, 95,2833,16,1971,US,plymouth,plymouth_duster
18.0,6,199, 97,2774,16,1971,US,amc,amc_hornet
21.0,6,200, 85,2587,16,1971,US,ford,ford_maverick
27.0,4, 97, 88,2130,15,1971,Japan,datsun,datsun_pl510
26.0,4, 97, 46,1835,21,1971,Europe,vw,vw_1131 deluxe sedan
25.0,4,110, 87,2672,18,1971,Europe,peugeot,peugeot_504
24.0,4,107, 90,2430,15,1971,Europe,audi,audi_100 ls
25.0,4,104, 95,2375,18,1971,Europe,saab,saab_99e
26.0,4,121,113,2234,13,1971,Europe,bmw,bmw_2002
21.0,6,199, 90,2648,15,1971,US,amc,amc_gremlin
10.0,8,360,215,4615,14,1971,US,ford,ford_f250
10.0,8,307,200,4376,15,1971,US,chevrolet,chevrolet_c20
11.0,8,318,210,4382,14,1971,US,dodge,dodge_d200
9.0,8,304,193,4732,19,1971,US,hi,hi_1200d
27.0,4, 97, 88,2130,15,1972,Japan,datsun,datsun_pl510
28.0,4,140, 90,2264,16,1972,US,chevrolet,chevrolet_vega 2300
25.0,4,113, 95,2228,14,1972,Japan,toyota,toyota_corona
19.0,6,232,100,2634,13,1972,US,amc,amc_gremlin
.IH1CNTL
~c~wThe derivation of the Information Harvesting rules is controlled ~
by the IH1CNTL control file. ~
The meanings of the parameters are described in the User's Manual. ~
The last few lines describe the properties of the data variables.
~b
title Automobile Acceleration
menu_title Auto:/Accel
AI_path \psg\workih\glosaa
AI_ex_num 2
data_strm_cnt 1
line_skip_cnt 1
data_file_names \ihdata\cars\cars.dat
bit_file_name cars.bit
pred_var_name TM_ACC
id_var_limit TM_ACC . 22
trnd_var_name YEAR
trnd_var_name ORIGIN
min_pred_diff 0.05
min_row_cnt 5
min_catg_cnt 1
pred_lvl_cnt 4
bin_skew .5
max_qual_vlty .7
tgt_bin_rl_cnt 10
ascii_rules 0
var_cnt 9
MPG Miles per gallon I D , 0 - N - + 6
CYL Cylinders I D , +1 - C - - 6
CU_IN Cubic Inches I D , +1 - N - * 6
HPWR Horsepower I D , +1 - N - * 6
WT_LBS Weight (lbs) I D , +1 - N - + 6
TM_ACC Time Accel 0-60 X D , +1 - N - - 12
YEAR Year I D , +1 - C - - 6
ORIGIN Origin X D , +1 - C - - 5
MAKE Make I D , +1 - C - - 6
.IH1LIM
~c~wThe first Information Harvest module, IH1, scans the data to find ~
the counts of nonNULL values, the average value and the extreme values ~
for each variable.
~b
rec_cnt 393
MPG 388 23.33 7.696 9.000 46.60 9.000 41.00 0 0
CYL 388 1 3 4 6 8
CU_IN 388 195.3 104.6 68.00 455.0 68.00 455.0 0 0
HPWR 388 105.0 38.31 46.00 230.0 46.00 182.0 0 0
WT_LBS 388 2983 850.1 1613 5140 1613 5034 0 0
TM_ACC 388 15.59 2.621 8.000 22.00 8.000 22.00 0 0
YEAR 388 4 1971 1972 1982 1983
ORIGIN 388 6 Europe Japan Japan US
MAKE 388 10 amc audi volvo vw
total data chars: 21189
.IH2TRND
~c~wIf one or more trnd_var_name variables are selected in IH1CNTL, ~
the module, IH2, will find either arithmetic or ~
geometric averages of specified variables in order to remove time, ~
departmental, or other trends.~b~m 0 0 5 0
trnd_cnt 13~m 0 0 0 0
trnd_catg 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983
MPG -12.61 15.94 18.00 21.00 18.15 17.27 22.73 20.57 21.68 23.07
23.46 25.15 31.28 30.21 31.71
CU_IN 52.39 469.0 1.422 1.082 1.159 1.298 0.8716 1.075 1.032 0.9667
0.9826 1.065 0.6740 0.7463 0.7371
HPWR 39.05 179.8 1.140 1.061 1.198 1.178 0.9430 1.021 1.001 1.006
1.025 1.016 0.8077 0.8305 0.8519
WT_LBS -1527 2000 3331 2955 3342 3394 2884 3150 3066 2995 2897 3011
2459 2534 2439 ~m 0 0 5 0
trnd_cnt 3 ~m 0 0 0 0
trnd_catg Europe Japan US
MPG -12.27 15.14 4.032 4.684 -2.505
CU_IN 63.99 375.3 0.6307 0.6641 1.287
HPWR 46.79 189.6 0.8271 0.8620 1.110
WT_LBS -1561 1654 -569.1 -594.4 345.4 ~m 0 0 5 0
MPG -10.00 10.00 ~m 0 0 0 0
CU_IN 60.00 400.0
HPWR 40.00 160.0
WT_LBS -1300 1400
.IH3NBIN
~c~wIH3 accumulates information about the distributions of the values for ~
the variables described in IH1CNTL. ~
Then, IH3 proposes bin boundaries based on the observed distributions. ~
IH3NBIN contains proposed boundaries for the numerical varibles.
~b~m 0 0 4 4
MPG 6 -12.27 -5.714 -3.016 -0.6349 2.857 6.190 15.14
CU_IN 6 60.00 103.2 132.9 178.7 216.5 257.0 400.0
HPWR 6 40.00 68.57 81.90 96.19 116.2 136.2 189.6
WT_LBS 6 -1561 -892.9 -442.9 -14.29 414.3 864.3 1654
TM_ACC 13 8.000 10.00 11.00 12.00 13.00 14.00 15.00 16.00
17.00 18.00 19.00 20.00 21.00 22.00
.IH3CBIN
~c~wIH3CBIN is also produced by IH3. ~
This file shows the proposed binning for the categorical variables. ~
The first four columns list the category names alphabetically. ~
The last four columns list the same information in order of the average ~
value of the predicted variable (TM_ACC, the time to accelerate from 0 to 60) ~
for each category value. ~
The second column is the proposed bin number. ~
the third column is the count of the number of occurences of the category.
~b
CYL
5 1 5 0
3 1 4 13.75 3 1 4 13.75
4 2 195 16.55 4 2 195 16.55
5 3 3 18.67 5 3 3 18.67
6 4 83 16.40 6 4 83 16.40
8 5 103 13.10 8 5 103 13.10
YEAR
13 4 6 0
1971 1 29 13.21 1971 1 29 13.21
1972 3 27 15.26 1974 2 40 14.55
1973 2 27 15.07 1973 2 27 15.07
1974 2 40 14.55 1972 3 27 15.26
1975 6 26 16.42 1980 3 28 15.36
1976 4 30 16.27 1978 3 28 15.57
1977 4 34 16.00 1979 4 36 15.89
1978 3 28 15.57 1977 4 34 16.00
1979 4 36 15.89 1976 4 30 16.27
1980 3 28 15.36 1983 5 29 16.31
1981 6 26 16.85 1982 5 28 16.32
1982 5 28 16.32 1975 6 26 16.42
1983 5 29 16.31 1981 6 26 16.85
ORIGIN
0 0 0 0
MAKE
30 10 6 0
amc 3 27 15.22 bmw 1 2 13.00
audi 4 7 16.14 chrysler 1 6 13.50
bmw 1 2 13.00 pontiac 1 16 14.13
buick 2 17 14.76 cadillac 1 2 14.50
cadillac 1 2 14.50 dodge 2 28 14.61
capri 2 1 15.00 buick 2 17 14.76
chevrolet 4 47 15.49 capri 2 1 15.00
chrysler 1 6 13.50 nissan 2 1 15.00
datsun 6 23 16.48 mercury 2 11 15.00
dodge 2 28 14.61 triumph 2 1 15.00
fiat 4 8 16.00 plymouth 3 31 15.03
ford 3 48 15.44 amc 3 27 15.22
hi 6 1 19.00 ford 3 48 15.44
honda 4 13 16.08 chevrolet 4 47 15.49
mazda 5 12 16.25 opel 4 4 15.50
mercedes 6 3 19.67 saab 4 4 15.50
mercury 2 11 15.00 fiat 4 8 16.00
nissan 2 1 15.00 volvo 4 6 16.00
oldsmobile 5 10 16.20 honda 4 13 16.08
opel 4 4 15.50 audi 4 7 16.14
peugeot 6 7 19.00 toyota 5 26 16.19
plymouth 3 31 15.03 oldsmobile 5 10 16.20
pontiac 1 16 14.13 mazda 5 12 16.25
renault 6 3 17.33 vw 5 19 16.47
saab 4 4 15.50 datsun 6 23 16.48
subaru 6 4 17.00 subaru 6 4 17.00
toyota 5 26 16.19 renault 6 3 17.33
triumph 2 1 15.00 peugeot 6 7 19.00
volvo 4 6 16.00 hi 6 1 19.00
vw 5 19 16.47 mercedes 6 3 19.67